Detection of non-stationarity in speech signals and its application to time-scaling

نویسندگان

  • David A. Kapilow
  • Yannis Stylianou
  • Juergen Schroeter
چکیده

This paper describes an automatic method for the detection of non-stationarity in speech signals. It is based on three measures of non-stationarity using Line Spectrum Frequencies (LSFs), the derivative of RMS values, and a combination of these two features. The application of the proposed method to time-scaling of speech signals is also presented. Results from an informal listening test support its usefulness. Following these results, the method seems to be a powerful tool for the automatic control of time-scale factors based on the characteristics of the input speech signal. Listeners preferred our new method over applying a constant time-scale factor in 90% of all cases. Other possible applications of the proposed tool are also discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...

متن کامل

Synchrosqueezing-based Transform and its Application in Seismic Data Analysis

Seismic waves are non-stationary due to its propagation through the earth. Time-frequency transforms are suitable tools for analyzing non-stationary seismic signals. Spectral decomposition can reveal the non-stationary characteristics which cannot be easily observed in the time or frequency representation alone. Various types of spectral decomposition methods have been introduced by some resear...

متن کامل

Speech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering

Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...

متن کامل

A Time-Frequency approach for EEG signal segmentation

The record of human brain neural activities, namely electroencephalogram (EEG), is generally known as a non-stationary and nonlinear signal. In many applications, it is useful to divide the EEGs into segments within which the signals can be considered stationary. Combination of empirical mode decomposition (EMD) and Hilbert transform, called Hilbert-Huang transform (HHT), is a new and powerful ...

متن کامل

Non-stationary, filter-stable acoustic objects

In spite of the undisputedly high degree of non-stationarity of speech signals, the present day determination of its acoustic features is based on the assumption that speech production can be described as a linear time invariant (LTI) system on the time scale of about 20 ms [1]. In automatic speech recognition, the wide sense stationarity of an LTI–system is used as prerequisite for the consist...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999